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ACCEPTANCE SAMPLING FOR THE NORMAL DISTRIBUTION: STANDARD 
DEVIATION UNKNOWN BUT ASSUMED CONSTANT 
by Steven M. Sidlk 
Lewis Research Center 

SUMMARY 

Standard deviation known plans can achieve a considerable reduction over standard 
deviation unknown plans in the number of sampled items required. One procedure often 
used is that of computing an estimate of the standard deviation based upon previously ac- 
cumulated data. This estimate is then used as the known value in a standard deviation 
known plan. The estimate is a random variable, however. It is shown that this causes 
the operating characteristic curve to be a random variable also and its distribution de- 
pends on the distribution of the estimate of the standard deviation. 

Since the operating characteristic curve is a random variable, it is of interest to 
consider the two problems of computing confidence limits on the curve given a random 
sample of data and of determining a sample size to achieve specified confidence limits. 
Solutions for each of these problems are presented. A numerical example is also used 
to illustrate the solution to the second problem. 


INTRODUCTION 

Variables acceptance sampling plans have been widely used in industry and govern- 
ment. This statistical quality control technique is used to maintain a minimum standard 
of quality for producers or consumers of goods. 

Assume that some item is produced, shipped, or received in lots of size M. Also 
suppose that associated with each item there is a critical variable which may be used as 
a quality characteristic. This may be some such thing as length, width, voltage, and 
so forth. 

This critical variable is most commonly assumed to be normally distributed. That 
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is, x ~ N(p, a ). This report studies the situation where p is assumed unknown and 
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constant within a lot but permitted to vary from lot to lot and a is assumed unknown 


but is constant over all lots. 

Any item which has the critical variable X > is called a defective that exceeds 
the upper specification limit X^. Likewise, if X < X L , the item is said to be a defec- 
tive falling below the lower specification limit X L> A variable may be restricted to lie 
between both an upper and a lower limit in which case we say it has double specification 
limits. 

Consider figure 1 where density functions for two normal distributions with the 
same variance but differing means are illustrated. The top graph of figure 1 shows 
x ~ There is a proportion p 2 of the items in a lot followingthis distribution 

which are defective. This proportion is called the consumers quality point and it is as- 
sumed the consumer would like the probability of accepting lots with this proportion de- 
fective or more to be less than or equal to 0. The bottom graph of figure 1 shows 
x ~ c r^. The proportion p^ is referred to as the producers quality point and it 

is assumed the producer would like the probability of accepting lots with this proportion 
defective or less to be greater than or equal to 1 - a. Most generally, a is about 0.01 
or 0.05 and 0 is about 0. 10 or 0 . 05 . The values pj and p 2 are mutually agreed upon 
by consumer and producer in some manner. 

Given a lot of M items, we wish to sample n such items randomly and decide 
whether or not too large a proportion of the lot falls outside the permissible range. 

This range may be limited by the upper limit X^, lower limit X^, or both an upper and 
lower limit. The sample size and decision procedure should be such that the producer 
and consumer are satisfied. That is, lots with a proportion defective of should be 
accepted about 100(1 - a ) percent of the time and those with proportion defective p 2 
should be accepted only about 1000 percent of the time. 

The points (p^, 1 - a) and (p 2 , 0) are two points through which the statistician would 
like to pass a curve called the operating characteristic (O. C. ) curve. This curve is ac- 
tually a function of the actual lot proportion defective and is defined as the probability of 
acceptance of a given lot with proportion defective p. It is denoted in this report as 

p a <p). 

The difference between cr-known and a-unknown plans is that, generally, much 
smaller sample sizes are needed for the cr-known plans to have them achieve essentially 
the same O. C. curve as the a-unknown plan. 

It is difficult to imagine a situation in which the standard deviation is ever really 
known. Generally what is meant by "known" is that an estimate from past data is avail- 
able. In fact, it is sometimes advocated that a a-known plan not be used unless a "good" 
estimate of a is available. See, for example, page 224 of Duncan (ref. 1). 

Duncan (ref. 1) considers sampling plans which meet the needs of both consumer and 
producer for the cases where a is assumed to be known and also where o is not as- 
sumed known. The results given in Duncan (ref. 1) are briefly summarized later and the 
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extension to the case where a is assumed unknown but constant over all lots is given. 

This report then studies the procedure whereby o-unknown plans are used for a cer- 
tain number N of lots. Then a pooled estimate of a may be made and this value used 
as if it were the true standard deviation. The effect of this procedure upon the operation 
and results of the cr-known plans is then presented. Following that, a rule for determin- 
ing an acceptable value of N is given and the whole procedure is illustrated by a hypo- 
thetical example. 

Special problems may arise in the double limit case if the standard deviation is 
large compared to the difference - X^. For simplification, further discussion will 
implicitly assume the standard deviation is not too large and all discussion will be with 
regard to the double limit case since it is slightly more general. 
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SYMBOLS 

variate 

variate 

- u ' 

critical values used in acceptance test 
number of items in a lot 
maximum standard deviation 

number of lots over which to pool estimators for & 

o 

normal distribution with mean n and variance a 
number of items randomly sampled from a lot 
sample size for cr-known plan 
sample size for cr-unknown plan 
probability of event E 

conditional probability of event E ^ given that event Eg occurred 

probability of accepting a lot with true proportion defective p 

upper confidence limit on operating characteristic curve at p^ 

lower confidence limit on operating characteristic curve at p^ 

conditional probability of accepting a lot with true proportion defective p, 
using estimate d 


lower 100 oig percent point of x^( n 

o 

upper 100(1 - ot^) percent point of XMV n 
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proportion defective 

producers quality point 

consumers quality point 

sample standard deviation 

degrees of freedom 

specific value of random variable x 

estimate of ju 

value such that if X < X L , item is considered defective 
value such that, if X > X^, item is considered defective 
random variable characterizing inspected item 

z y =*~ 1 (r) 

denotes random variable following standard normal distribution 
producers risk 
consumers risk 

mean value or expected value of x 

value of 11 such that, if x~n/ju , then observed item has probability 
p of being defective ' 

standard deviation of x 

variance of x 

estimate of a 

cumulative standard normal distribution function, i. e. , P(z <Z) = < 3 ?(Z) 
inverse cumulative standard normal distribution function 
Chi-squared distribution with r degrees of freedom 
is distributed as 
is approximately distributed as 

DISCUSSION OF ASSUMPTIONS 


It is assumed that the output of a manufacturing process is a product whose critical 
value is normally distributed. It is also assumed that the variance of the product is con- 



stant while the mean varies slowly with time. An attempt should be made to justify these 
assumptions. 

First consider the assumption that the critical variable is normally distributed. In 
many industrial situations the items are coming from a process that is "in control. " 
This term means that the major sources of variability of the product have been identified 
and eliminated. There remains an accumulation or sum of a number of small effects. 

A generalization of the central limit theorem due to Lapunov gives conditions under 
which a sum of independent random variables approximates a normal distribution (see 
p. 202 of ref. 2). A very loose statement of the theorem is that a sum of m indepen- 
dent random variables with about the same second and third moments approaches a nor- 
mal distribution as m gets large. This theorem can thus be applied to the accumula- 
tion of the small random effects to justify the normality of the critical variable. 

An empirical test for the joint normality of a group of independent samples is pro- 
vided by Wilk and Shapiro (ref. 3). This test does not assume equal variances of the 
populations nor does it assume equal means. Thus, it is an ideal test for this applica- 
tion. If normality is rejected, some other distribution and method must be used. To 
justify the assumption of equal variances, a test such as Bartlett's test for homogeneity 
of variance (pp. 642 to 643 of ref. 1) may be applied. If equality of variances is re- 
jected, the procedure described here may not be used. The or-unknown procedure should 
be used for all lots. It should also be recalled that the product is assumed to come from 
a process whose mean varies slowly with time. Thus, a procedure which uses one large 
initial sample to estimate a would have a much higher probability of giving a biased 
estimate of a than one using a series of independent small samples. There will be a 
small loss of efficiency due to the loss of one degree of freedom per sample. 


g-UNKNOWN plans 

Suppose a double-limit variables acceptance plan is required for a situation in which 
cr is unknown and X^, X^, o’, 0, p^, and Pg are specified. Let 

*->(1 - p ,) = z 1 . Pi 

and 

*-'<1 - p 2 ) = Z,. P2 

Then Duncan (p. 240, ref. 1) gives the following equations to obtain k* and n u : 
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( 1 ) 


k* = 


Z l-« Z l-p 2 + Z l-0 Z 1- Pl 
Z l-a + Z l-/3 


n u ~ 


" 1+ M! 

2 


Z l~a + Z l-j 3 
Z l- Pl - Z l-p 2y 


( 2 ) 


The procedure of the test is to observe a sample of size n u from a lot of items and 
compute 


X = 


n 


u 




u 


(3) 


s = 


u 


|l/2 


E( x i - x ^ 2 

i=l 


n u ' 1 


The lot is then accepted if 


X ~ X L 

s 




and 


x u- 


X 


& k 


(4) 


(5) 


s < MSD J 

where MSD is the maximum standard deviation as explained in Duncan (p. 246, ref. 1). 
If any of the previous three conditions fails to hold, the lot is rejected. 
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a-KNOWN PLANS 


Suppose a double-limit plan is needed when X L , a, / 3 , pj, and p 2 are speci- 
fied. 

The parameters and k are then given by 


n. 



k = 




2 


(6) 


(7) 


The operation of this plan is to observe a random sample of size and compute X as 
in equation (3). The lot is accepted if 


and 


x-x L 



cx 


> 



( 8 ) 


If either of the previous two conditions fails, the lot is rejected. 

To find the O.C. curve for this plan, let p denote the true population proportion 
defective and temporarily assume they are at the lower limit; that is, the proportion of 
defectives having X > X^ is negligible. This may be represented graphically as 



X L Mp 


X U 
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I 



The lot is accepted if 


X - X T 

L > k 

a 


Thus, 



where 


X- ii 


P 





N(0, 1) 


If the defectives are at the upper limit, that is, the proportion of defectives having 
X < X L is negligible, this may be represented graphically as 


*L 



and the lot is accepted if 


X 


V 


X 


> k 


Then 
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p a (p) 






z p - k > = 


x - u 




(-k- 



which by symmetry of the standard normal distribution becomes 


P a (p) = 


r— 

X - M, 





since 


— — ^ ~ N(0, 1) 

a /V"k 


a-ESTIMATED PLANS 

Noting equations (6) and (7), it is readily seen that knowledge of a is not necessary 
to specify the parameters of the plan. Denote the estimate of a by a. Then in place 
of equation (8) we get 


and 


X - X T 


> k 


Xy-X 


a 


>k 


Let the particular estimate d be given by 


(10) 
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a = 


(ii) 


xt n 
N u 

£ £ for *i) 2 


N(n u - 1) 


To find the O. C. curve for this plan, let p denote the true population proportion defec- 
tive and assume the defectives are at the low end. This is represented graphically as 



The lot is accepted if 


X - X T 

^ 

a 

The probability of this event depends upon the value of o and is given by 



If the defectives are at the upper end, we have the following graphic representation: 
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and the result is identical to equation (12). 

It is now seen that the conditional probability of acceptance of lots, after the esti- 
mate <7 is made and used, is a random variable whose distribution depends upon the 
distribution of a. That is, the actual probability of acceptance is unknown but constant 
for all lots following the first N lots, and for different initial samples of N lots it 
would vary. 


DETERMINATION OF CONFIDENCE LIMITS AND N 

It is well known that 

N(n - l)d 2 2 

J— ~*N(n u -D 

Thus, let 



where both square roots are assumed to be the positive values and where a fl is the 

2 2 

lower 100 x a ^ percent and bj_ a the upper 100 x (1 - a^) percent points of a 
variate. 1 U 

Then, given N, o^, and o^, upper and lower limits for the O.C. curve could be 
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calculated at any point p from equation (12) and a set of tables of the chi-square and 
normal distribution functions. 

Now the same upper and lower confidence limits are not necessarily required for 
all p. Denote upper and lower confidence limits on the O. C. curve at p = p^^ where 
i = 1, 2 by P^Pj) and P"^), respectively. 

The same confidence levels are not necessarily required for all values of p either. 
In fact, a producer has no concern over an overestimated O.C. curve at p^ and a con- 
sumer has no concern over an underestimated O.C. curve at Pg. 

Thus, let of i denote the lower confidence level desired on the lower limit of the 
O.C. curve at the point p and a 0 denote the upper confidence level desired on the 

Z , p 

upper confidence limit of the O. C. curve at the point p. 

From a set of tables of the chi-square distribution we could determine by trial and 
error a minimum N that would satisfy 


>|p;(Pi) = p a (p, l s) = a 1 - (“i, P , + “ 2 , Pi ) 


for i = 1, 2 


(13) 


If N satisfied the inequalities exactly, the situation could be represented as in fig- 
ure 2 where the dash dot curve represents the nominal O.C. curve and the dashed upper 
and lower curves denote upper and lower confidence limits. Density functions of the at- 
tained O.C. curve are represented vertically on the dashed lines at p = p^ and p = pg. 

The determination of N may be simplified if we require that N(n y - 1) > 30. This 
would most likely be true in practical situations. Then we may use the fact (ref. 4, 
p. 400) that 



1 , 


>) 


for v > 30. But we have 


N(n u - l)a 2 


*N(n u -l) 


so that 


a 

a 


|/2N(n u - 1) * n(^ 


2N(n u - 1) - 1, 


■) 


From equation (13) we get 
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P { P I< p i> s P a( p i 1 5 ) s P >1>} ' p { p ;(Pi> S P { z - ( k \ + Z Pi ) V”k | 5 } £ P a (p i>} 


■ p { p ;< p i> a p { z s (- z p, - k !) I s } s p a (p i’} 

= p { z r s ( z i- Pl -^)V^ sz i} 


where z is a unit normal variate and where 

z l = *'‘(w) 

z t = *' 1 ( p >i>) 

Let 



2 2 2 

Recalling that c o ~ N(ju , a ) implies a + bw ~ N(a + bfi,b a ), it is easily seen that 


z = z i- Pl V"k - k ; V"k - z l- Pl V"k 


" k V”k 


|/2N(n u - 1) 




= a, + b 


- y2N(n u - 1) 


where 


| y2N(n u - 1) * N^2N(n u - 1) - 1, l) 
implies z * N^a. + by2N(n u - 1) - 1, b 2 ). Hence, 



z = 


z - a i - by2N(n u - 1) - 1 


N(0, 1) 


and equation (14) may be rewritten as 



The problem is thus solved if an N can be found to satisfy all of the following si- 
multaneously: 


Zj - a^ - b|/ 2N(n u - 1) - 1 


a 


l,Pi 


Z, 3 

1-a 


- a x - b^2N (n u - 1) - 1 


a 


2, Pl 


1,P 2 


' 1-Q! 


Z- - a 2 - b'^/2N(n u - 1) - 1 


Z+ - a 2 - b^2N(n u - 1) - 1 


2 ,P 2 


(16.1) 


(16.2) 


(16.3) 


(16.4) 


To derive an approximating solution, let be the minimum N which will satisfy 
equation (16. i) where i = 1, 2, 3, or 4. These inequalities will then become approxi- 
mate equalities. Solving these for and then setting N = maxjN. j will yield the de- 
sired solution. For example, let i = 1. Then we have 


Z a |b| « Zj - a x - b^2N(n u -1) - 1 
1 >Pj 


Thus, 
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V 2 N l (n u - » 



k V n k[ 2 N l (n u ~ *> ~ *] 

V 2 N l'"u - « 


For large values of N^(n u - 1), it is seen that 

2 N l (n u " “ 2 N l( n u “ !) “ 1 

Then let 

N* = 2Nj(n u - 1) » 2N 1 (n u - 1) - 1 
and substitute this in equation (17) to get 



N 


1 


1 

2 (”u - »> 



Similar derivations beginning with equations (16. 2), 16.3), and (16.4) will yield 
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(20) 




( 21 ) 


( 22 ) 


EXAMPLE 

Suppose a consumer purchases large quantities of rivets. If a rivet is larger in 
diameter than 0. 252 centimeter, it will not fit in the standard size rivet hole and if it is 
smaller than 0. 240 centimeter it will slip through the standard size rivet hole. 

The producer of the rivets feels very strongly that his process is in very tight con- 
trol and thus he can achieve the limitations of = 0. 252 centimeter and X L = 0. 240 
centimeter very easily. The consumer is not ready to accept this on faith so he will in- 
stitute a variables acceptance sampling plan. Both agree that the diameter of rivets is 
normally distributed. 

The consumer feels that if there is much over 15 percent defective items per lot, 
his workers will be wasting too much time with defects for him to readily absorb. He 
also feels that a level of 1 percent defective is desired. Thus the consumer and pro- 
ducer mutually agree upon the following nominal figures: 

p 1 = 0.01 a = 0. 05 

p 2 = 0.15 /3 = 0.10 

Then 
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= 1.645 


Z 1-p = 4> _1 (1 - Pj) = 2.326 
Z 1 = $ _1 (1 - p 2 ) = 1.036 Z 1 _ j8 = 1. 282 


and 


k* = 


Z l-Q! Z l-p 2 


J 1- Of 


+ Z l-i3 Z l- Pl 
+ Z l-/3 


(1. 645)(1. 036) + (1, 282)(2. 326) _ 1 6Q1 
1.645 + 1.282 



! , (1.601) 2 
2 


/ 1. 645 + 1.282 \ 2 
\2.326 - 1.036/ 


11.75 as 12 


The producer and consumer also mutually agree that the actual probability of ac- 
ceptance at Pj should be no less than 0. 988 with probability 0. 999 and the actual prob- 
ability of acceptance at p 2 should be no greater than 0. 11 with probability 0. 999. Thus, 


P;( Pl ) = 0.988 


a. n = 0.001 
Pi 


p+(p 2 ) =. 0. 11 


a 0 _ =0. 001 


Only equations (19) and (22) really need be considered since an overestimate of the O.C. 
curve at pj and an underestimate at p 2 are not really undesirable consequences. The 
following quantities are needed for the calculations: 


Z 

Z 

Z 


a , “ Z 0.001 " ~ 3 - 090 

1 5 


l-a 0 „ " Z 0. 999 “ 3 - 090 


2 ^2 


.-1 


1 = ^"■'(O.gSS) = 2.257 


Thus, from equations (6) and (7) 


z Pj = z o.oi -- 2 - 326 

^^o.is '- 1 - 036 

Z+ = 4. _1 <0. 11) = -1.226 
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Then 


= /1_ .645 + 1 . 2 82 \ = 2.26 « 3 
K \2. 326 - 1.036/ 


12.326 - l - 6 - 4 - 5 ] + (l. 036 + 1,282 


k = 


VI 


Vi 


= 1.576 


N 


1 ~ 


1 

2 ( 11 ) 


(-3. 090)(1. 576 ) Vi 
2.257 - (2.326 - 1.576) Vi 


(23) 



(3. 090)(1. 576) V i 
-1.226 - (1.036 - 1. 576) Vi 


38.27 « 38 


Thus a conservative procedure is to pool information over 38 lots, estimate o, and 
then use the a-known plan. 


CONCLUSION 

A method has been presented for evaluating the effect upon the design and operating 
characteristic curve of a standard deviation a known acceptance sampling plan when in 
fact the standard deviation has been estimated from previous data. 

It was shown that the design of the standard deviation known plan is unaffected, but 
that the operating characteristic (O. C. ) curve becomes a random variable. It was then 
shown how to use an approximation based upon the normal distribution to compute the 
number of observations necessary to estimate a such that the actual O. C. curve falls 
within specified confidence limits of the nominal O. C. curve. 

Lewis Research Center, 

National Aeronautics and Space Administration, 

Cleveland, Ohio, January 27, 1971, 

129-04. 
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Operating characteristic curve 

Upper and lower confidence limits 



Figure 2 . - Possible nominal 0. C. curve with lower and upper confidence 
limits. (Possible density functions of attained 0. C. curve at p] and 
p 2 are sketched vertically. ) 
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